Determining Intercoder Agreement for a Collocation Identification Task
نویسندگان
چکیده
In this paper, we describe an alternative to the kappa statistic for measuring intercoder agreement. We present a model based on the assumption that the observed surface agreement can be divided into (unknown amounts of) true agreement and chance agreement. This model leads to confidence interval estimates for the proportion of true agreement, which turn out to be comparable to confidence intervals for the kappa value. Thus we arrive at a meaningful alternative to the kappa statistic. We apply our approach to measuring intercoder agreement in a collocation annotation task, where human annotators were asked to classify PP-verb combinations extracted from a German text corpus as collocational versus non-collocational. Such a manual classification is essential for the evaluation of computational collocation extraction tools.
منابع مشابه
The Impact of L2 Semantic Tasks (L2 Collocation versus L2 Definition) on Iranian Intermediate EFL Learners’ Vocabulary Achievement
This study investigated the relationship between teaching L2 semantic tasks (collocation vs. definition) in vocabulary achievement of Iranian intermediate EFL learners. To this end, 60 students at intermediate level studying in the Simin Institute were selected from a total number of 100 participants based on their performance on Oxford Placement Test. After ensuring the criterion of homogeneit...
متن کاملIdentification and Analysis of Critical Activities of Firefighting Department for Structural Fire Scenarios Using Task and Training Requirements Analysis (TTRAM)
Introduction: Increasing the civil incidents including residential fires is a consequence of population growth and development of cities. Residential fire is one of the most important scenarios requiring fast response. Fire response operation encompass various and serious risks for responding team members. Therefore, the present study looks for determining the critical tasks of fire operation r...
متن کاملAutomatic Identification of Lexical Units
Lexical unit is a word or collocation. Extracting lexical knowledge is an essential and difficult task in NLP. The methods of extracting of lexical units are discussed. We present a method for the identification of lexical boundaries. The problem of necessity of large corpora for training is discussed. The advantage of identification of lexical boundaries within a text over traditional window m...
متن کاملThe Effects of Collaborative and Individual Output Tasks on Learning English Collocations
One of the most problematic areas in foreign language learning is collocation. It is often seen as arbitrary and an overwhelming obstacle to the achievement of nativelike fluency. Current second language (L2) instruction research has encouraged the use of collaborative output tasks in L2 classrooms. This study examined the effects of two types of output tasks (editing and cloze) on the learni...
متن کاملParsing and MWE Detection: Fips at the PARSEME Shared Task
Identifying multiword expressions (MWEs) in a sentence in order to ensure their proper processing in subsequent applications, like machine translation, and performing the syntactic analysis of the sentence are interrelated processes. In our approach, priority is given to parsing alternatives involving collocations, and hence collocational information helps the parser through the maze of alterna...
متن کامل